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Abstract 

We investigate nonparametric estimation of a monotone baseline hazard and a decreas- 
ing baseline density within the Cox model. Two estimators of a nondecreasing baseline 
hazard function are proposed. We derive the nonparametric maximum likelihood esti- 
mator and consider a Grenander type estimator, defined as the left-hand slope of the 
greatest convex minorant of the Breslow estimator [i]. We demonstrate that the two 
estimators are strong consistent and asymptotically equivalent and derive their common 
limit distribution at a fixed point. Both estimators of a nonincreasing baseline hazard 
and their asymptotic properties are acquired in a similar manner. Furthermore, we in- 
troduce a Grenander type estimator for a nonincreasing baseline density, defined as the 
left-hand slope of the least concave majorant of an estimator of the baseline cumulative 
distribution function, derived from the Breslow estimator. We show that this estimator 
is strong consistent and derive its asymptotic distribution at a fixed point. 

Keywords: Cox model, Breslow estimator, greatest convex minorant, nonparametric max- 
imum likelihood, cube-root asymptotics, empirical processes 

1 Introduction 

Shape constrained nonparametric estimation dates back in the 1950s. The milestone 
paper of Grenander [7| introduced the maximum likelihood estimator of a nonincreasing 
density, while Prakasa Rao |16j derived its asymptotic distribution at a fixed point. Similarly, 
the maximum likelihood estimator of a monotone hazard function has been proposed by 
Marshall and Proschan |14j and its asymptotic distribution was determined in pJ7j. Other 
estimators have been proposed and despite the high interest and applicability, the difficulty 
in the derivation of the asymptotics was a major drawback. Shape constrained estimation 
was revived by Groeneboom j8], who proposed an alternative for Prakasa Rao's bothersome 
type of proof. Groeneboom's approach employs a so-called inverse process and makes use 
of a Hungarian embedding or a KMT construction. Once such an embedding is available, 
it enables the derivation of the asymptotic distribution of the considered estimator. This is 
the case, for example, when estimating a monotone density or hazard function from right- 
censored observations, as proposed by Huang and Zhang [llj and Huang and Wellner [lOj. 
Another setting for deriving the asymptotic distribution, that does not require a Hungarian 
embedding, was later provided by the limit theorems in [12j. Their cube root asymptotics are 
based on a functional limit theorem for empirical processes. 



The present paper treats the estimation of a monotone basehne hazard and a decreasing 
basehne density in the Cox model. Ever since the model was introduced (see [1]) and in 
particular, since the asymptotic properties of the proposed estimators were first derived by 
Tsiatis [21], the Cox model is the classical survival analysis framework for incorporating 
covariates in the study of a lifetime distribution. The hazard function is of particular interest 
in survival analysis, as it represents an important feature of the time course of a process under 
study, e.g., death or a certain disease. The main reason lies in its ease of interpretation and 
in the fact that the hazard function takes into account ageing, while, for example, the density 
function does not. Times to death, infection or development of a disease of interest in most 
survival analysis studies are observed to have a nondecreasing baseline hazard. Nevertheless, 
the survival time after a successful medical treatment is usually modeled using a nonincreasing 
hazard. An example of nonincreasing hazard is presented in Cook et al. [3], where the authors 
concluded that the daily risk of pneumonia decreases with increasing duration of stay in the 
intensive care unit. 

Chung and Chang [2] consider a maximum likelihood estimator of a nondecreasing baseline 
hazard function in the Cox model, adopting the convention that each censoring time is equal 
to its preceding observed survival time. They prove consistency, but no distributional theory 
is available. We consider a maximum likelihood estimator A„ of a monotone baseline hazard 
function, which imposes no extra assumption on the censoring times. This estimator differs 
from the one in [2j and has a higher likelihood. Furthermore, we introduce a Grenander 
type estimator for a monotone baseline hazard function based on the well-known baseline 
cumulative hazard estimator, the Breslow estimator A„. The nondecreasing baseline hazard 
estimator A„ is defined as the left-hand slope of the greatest convex minorant (GCM) of A„. 
Similarly, a nonincreasing baseline estimator is characterized as the left-hand slope of the 
least concave majorant (LCM) of A„. It is noteworthy that, just as in the no covariates 
case (see [ID]), the two monotone estimators are different, but are shown to be asymptotically 
equivalent. Additionally, we introduce a nonpar ametric estimator for a nonincreasing baseline 
density. An estimator F„ for the baseline distribution function is based on the Breslow 
estimator and next, the baseline density estimator /„ is defined as the left-hand slope of the 
LCM of Fn- The treatment of the maximum likelihood estimator for a nonincreasing baseline 
density is much more complex and is deferred to another paper. For the remaining three 
estimators, we show that they converge at rate n^/^ and we establish their limit distribution. 
Since, to the authors best knowledge, there does not exist a Hungarian embedding for the 
Breslow estimator, our results are based on the theory in |12] and an argmax continuous 
mapping theorem in [10 ^ 

The paper is organized as follows. In Section [2] we introduce the model and state our 
assumptions. The formal characterization of the maximum likelihood estimator A„ is given in 
Lemmas 2.1 and 2.2 Our main results concerning the asymptotic properties of the proposed 
estimators are gathered in Section [3j Section [4] is devoted to the proofs of strong consistency 
and to establishing strong uniform consistency of the Breslow estimator and the correspond- 
ing estimator Fn of the baseline cumulative distribution function. In order to prepare the 
application of results from [T2] , in Section [5| we introduce the inverses of the estimators in 
terms of minima and maxima of random processes and obtain the limit of these process. 
Finally, in Section [6] we derive the asymptotic distribution of the estimators, at a fixed point. 
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2 Definitions and Assumptions 



Let the observed data consist of independent identically distributed triplets (Tj, Aj,Zj), 
with i = 1,2, ... ,n, where denotes the follow-up time, with a corresponding censoring 
indicator Aj and covariate vector Zi S R^. A generic follow-up time is defined by T = 
min(X, C), where X represents the survival time and C is the censoring time. Accordingly, 
A = {X < C}, where {•} denotes the indicator function. The survival time X and censoring 
time C are assumed to be conditionally independent given Z, that is to say that the censoring 
mechanism is non-informative. The covariate vector Z € M*^ is assumed to be time invariant 
and non-degenerate. 

Within the Cox model, the distribution of the survival time is related to the corresponding 
covariate by 

(2.1) A(x|z) = Ao(x)e^o^ 

where A {x\z) is the hazard function for an individual with covariate vector z £ M^, Aq rep- 
resents the baseline hazard function and /3o G K,*^ is the vector of the underlying regression 
coefficients. Conditionally on Z = z, the survival time X is assumed to be a nonnegative ran- 
dom variable with an absolutely continuous distribution function F{x\z) with density f{x\z). 
The same assumptions hold for the censoring variable C and its distribution function G. 
The distribution function of the follow-up time T is denoted by H. We will assume the fol- 
lowing conditions, which are commonly employed in deriving large sample properties of Cox 
proportional hazards estimators (e.g., see |21j). 

(Al) Let Tp, tg and th be the end points of the support of F, G and H respectively. Then 



TH = tg < Tp- 



(A2) There exists e > such that 



sup E 

\f}-M<e 



\Z\'e 



2 2/3'Z 



< oo. 



where 



denotes the Euclidean norm. 



2.1 Increasing baseline hazard 



Let A(a;|z) = — log(l — F{x\z)) be the cumulative hazard function. Then, from (2.1) 
it follows that K{x\z) = Ao(2;) exp(/3Qz), where Ao(x) = j^\o{u)(iu denotes the baseline 
cumulative hazard function. Using this together with the relation A = //(I — F), the full 
loglikelihood can be written as 



(2.2) 



For /3o G 



i=l 



Ai log Ao(TO + AijS'oZ, - e^'o^^AoiTi] 



fixed, we first consider maximum likelihood estimation for a nondecreasing Aq. 
This requires maximization of (2.2) over all nondecreasing Aq. Let ^(i) < T(2) < • • • < T(„) be 
the ordered follow-up times and, for i = 1, 2, . . . , n, let A(j-) and be the censoring indicator 
and covariate vector corresponding to T(j) . Similar to [T3] and Section 7.4 in [T9] , since Ao(T'(„)) 
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can be chosen arbitrarily large, we first consider maximization over nondecreasing Aq bounded 
by some M > 0. One can then argue that the solution is an increasing step function, that 
is zero for x < T(i), constant on [T^j), T(j_|_i)), for i = 1, 2, . . . , n — 1, and equal to M, for 
X > T(^n) ■ Consequently, for /3 S fixed, the loglikelihood reduces to 



n-l 



i-1 



(2.3) 



L^(Ao) = log Ao(%) - e^'^i^^ (^O+D " ^0)) ^0(^0)) 

i=l 1=2 j=l 

n— 1 ( n \ 

= Y ^(i) log Ao(T(,)) - Ao(%) (r(,+i) - r(,)) Y 

1=1 I 1=1+1 ) 



Maximization over < Ao(T(i)) < • • • < Ao(T(„_i)) < M, will then have a solution A^^(x;/3) 
and by letting M — )• 00, we obtain the NPMLE A„(j;;/3) for Aq- Its characterization can be 
described by means of the processes 



(2.4) 

and 
(2.5) 



Vn{x) = j 5{u<x]<i¥n{u,5,z), 



with (3 and x > 0, where Fn is the empirical measure of the (Tj, Aj, Zj) and is given by 
the following lemma. 



LEMMA 2.1. For a fixed /3 G W, let Wn and Vn he defined in ([2^ and Then, the 

NPMLE Xn{x] /?) of a nondecreasing baseline hazard function Aq is of the form 



Xn{x;P) = < 



X <T( 



(1)' 



Aj T(j) < X < T(j+i), fori = 1,2,..., n-l, 



00 X > T( 



(n), 



where Aj is the left derivative of the greatest convex minorant at the point Pi of the cumulative 
sum diagram consisting of the points 



Pi 



T^„(/3,T(,+i)) - Vr„(/3,T(i)),y„(r(,+i)) , 



for j = 1,2, . . . ,n — 1 and Pq = (0, 0). Furthermore, 



(2.6) 



A,- = max min 



l<s<i i<t<n-l ^*._„ (T( 



j=s (j+l) 



'(0 



for i = 1,2, . . . ,n — 1. 



Proof. First, notice that the loglikelihood function in (2.3) can also be written as 

n-l 



(2.7) 



Y {5ilogAo(T(i)) - Ao(r(i))} 



Wi, 



i=l 
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where, for i = 1, 2, . . . , n — 1, 



Wi 



l=i+l 



and 



A 



9i 



(0 



As mentioned beforehand, we first maximize over nondecreasing Aq bounded by some M. 
Since M can be chosen arbitrarily large, the problem of maximizing (2.7) over < Ao(T'(x)) < 
■ ■ ■ ^ ^o(^(n-i)) — ^ can be identified with the problem solved in Example 1.5.7 in [19j. The 
existence and uniqueness of is therefore immediate and is given by 



x<Ti 



(1)' 



Aj T(i) < X < T(i+i), for i = 1, 2, . . . , n - 1, 



M x>T, 



(n), 



where, as a result of Theorems 1.5.1 and 1.2.1 in |19j . the value Aj is the left derivative at Pi 
of the GCM of the cumulative sum diagram (CSD) consisting of the points 



/ 1 ' 1 ' 



1,2, 



,n - 1, 



and Po = (0,0). It follows that 

i i 1 

■ " ■ ■ i=i 



(»+i) 



(1) 



For the y-coordinate of the CSD, notice that 



By letting M — )• oo, we obtain the NPMLE A„(/3,x) for Aq. The max-min formula in (2.6) 
follows from Theorem 1.4.4 in [19]. □ 



REMARK 2.1. One can argue that the maximizer of (2.2) must he constant between suc- 
cessive uncensored follow-up times. From the characterization given in Lemma 2.1, it can 
he seen that the GCM of the CSD only changes slope at points corresponding to uncensored 
observations, which means that \n{x](3) is constant between suc cessi ve uncensored follow-up 
times. Moreover, similar to the reasoning in the proof of Lemma 2.1. it follows that \n{x](3) 
maximizes (2.2). The reason to provide the characterization in Lemma 2.1 in terms of all 
follow-up times is that this facilitates the treatment of the asymptotics for this estimator. 
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In practice, one also has to estimate (3q. Since the maximum partial likelihood estimator 
for /3q is asymptotically efficient under mild conditions and because the amount of information 
on /3o lost through lack of knowledge of Aq is usually small (see e.g.,[6l [ISj [20]), we do not 
pursue joint maximization of ( |2.2| ) over nondecreasing Aq and /3o- We simply replace /3 in 
Xnix; f3) by /3n, and we propose Xn{x) = Xn{x; /3„) as our estimator for Aq- 

Note that A„ is different from the estimator derived in [2j, where each censoring time is 
taken equal to the preceding observed survival time. This leads to a CSD that is slightly 



different from the one in Lemma 2.1 However, it can be shown that both estimators have 
the same asymptotic behavior. Furthermore, note that if we take all covariates equal to 
zero, the model coincides with the ordinary random censorship model with a nondecreasing 



hazard function as considered in [TO]. The characterization in Lemma 2.1, with all Zi = 0, 



differs slightly from the one in Theorem 3.2 in [lOj. Their estimator seems to be the result 



of maximization of (2.2) over left-continuous Aq that are constant between follow-up times. 



Although this estimator does not maximize (2.2) over all nondecreasing Aq, the asymptotic 



distribution will turn out to be the same as that of A„, for the special case of no covariates. 

Another possibility to estimate a nondecreasing hazard is to construct a Grenander type 
estimator, i.e., consider an unconstrained estimator A„ for the cumulative hazard Aq and take 
the left derivative of the GCM as an estimator of Aq. Several isotonic estimators are of this 
form (see e.g., [3 [H [lOl [5]). Let < X(2) < •■• < ^(m) denote the ordered, observed 
survival times. Breslow [1] proposed 

(2.8) A„(x)= y 

as an estimator for Aq, where di is the number of events at A'(j) and f3n is the maximum partial 
likelihood estimator of the regression coefficients. The estimator A„ is most commonly referred 
to as the Breslow estimator. Following the derivations in it can be inferred that 

, . , . , dH'"'{x)/dx 

(2-9) Ao(x) = .J- , , 

E [{T > x} expiP'^Z)] 

where H^^{x) = F(T < x, A = 1) is the sub-distribution function of the uncensored observa- 
tions. Consequently, it can be derived that 

(2-10) Ao(x) = / ^rr^l^!r„rL. dP{u, 6, z), 



E[{r>x}exp(/3^Z)] 

where P is the underlying probability measure corresponding to the distribution of (T, A, Z). 
Note that from (Al), it follows that Aq{th) < oo. In view of the above expression, an intuitive 



baseline cumulative hazard estimator is obtained by replacing the expectations in (2.10) by 



averages and by plugging in /3„, which yields exactly the Breslow estimator in (2.8). As a 
Grenander type estimator for a nondecreasing hazard, we propose the left derivative A„ of 
the greatest convex minorant A„ of A„. This estimator is different from A„ for finite samples, 
but we will show that both estimators are asymptotically equivalent. For the special case of 
no covariates, this coincides with the results in |10j . 

2.2 Decreasing baseline hazard 

A completely similar characterization is provided for the NPMLE estimator of a non- 
increasing baseline hazard function. As in the nondecreasing case, one can argue that the 
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loglikelihood is maximized by a decreasing step function that is constant on (T(j_i), T^j)], for 
i = 1, 2, . . . , n, where T(o) = 0. In this case, the foghkehhood reduces to 

Lfsi^o) = E ^» log Ao(T(,)) - Ao(T(,)) (T(,) - T(,_i)) ^ e'''^(0 , 
t=i I 1=1 ) 

which is maximized over all \Q{T^^l^) > • • • > Ao(T'(„-)) > 0. The solution is characterized by 



the following lemma. The proof of this lemma is completely similar to that of Lemma 2.1 



LEMMA 2.2. For a fixed /3 G RP, let W„ he defined in (2.4) and let 



(2.11) Yn{x) = j 5{u<x]^n{u,5,z). 

Then the NPMLE Xn{x; (3) of a nonincreasing baseline hazard function Aq is given by 

Xn{x; P) = Xi for x e (T(i_i) , r(j)] , 

for z = 1, 2, . . . , n, where Xi is the left derivative of the least concave majorant (LCM) at the 
point Pi of the cumulative sum diagram consisting of the points 

p,- = (t^„(/3,T(,)),y„(r(,.))), 

for j = 1,2, ... ,n and Pq = (0, 0) . Furthermore, 

^ ^j=s ^(i) 

for i = 1,2, . . . ,n. 

As before, we propose Xn{x) = Xn{x; f3n) as an estimator for Aq, where /3„ denotes the 
maximum partial likelihood estimator for /3o. Similar to the nondecreasing case, the Grenan- 
der type estimator An for a nonincreasing Aq is defined as the left-hand slope of the LCM of 



Xi = max mm 

l<s<i i<t<r 



the Breslow estimator A„, defined in (2.8). 



2.3 Decreasing baseline density 

Suppose one is interested in estimating a nonincreasing baseline density /q. In this case, 
the corresponding baseline distribution function Fq is concave and it relates to the baseline 
cumulative hazard function Aq as follows 

(2.12) Fo(x) = l-e~^o(^). 
Hence, a natural estimator of the baseline distribution function is 

(2.13) F„(x) = l-e-^"(^), 



where A„ is the Breslow estimator, defined in (2.8). A Grenander type estimator /„ of a 
nonincreasing baseline density is defined as the left-hand slope of the LCM of Fn. 

The derivation of the NPMLE for /q is much more complex than the previous estimators 
and its treatment is postponed to a future manuscript. In the special case of no covariates, 
the NPMLE /„ has first been derived in [11]. In [10] a different characterization has been 
provided for /„ in terms of a self-induced cusum diagram and it was shown that fn and fn 
are asymptotically equivalent. 
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3 Main Results 



In this section, we state our main results. The proofs are postponed to subsequent sections. 
The next theorem provides the consistency of the proposed estimators. 

THEOREM 3.1. Assume that (Al) and (A2) hold. 

(i) Suppose that Aq is nondecreasing on [0, oo) and let A„ and A„ be the estimators defined 
in Section 2.1 Then, for any xq G {0,th), 

>^o{xq-) < Uminf A„(a;o) < hmsupA„(xo) < Ao(a;o+), 

Ao(2;o-) < hminf A„(a;o) < hmsupA„(a;o) < Ao(a;o+), 

71— s>oo n—¥oo 

with probability one, where the values Ao(xo— ) and Ao(xo+) denote the left and right 
limit at xq. 

(a) Suppose that Aq is nonincreasing on [0, oo) and let An and An be the estimators defined 
in Section 2.2 Then, for any xq G {0,th), 

Ao(2;o+) < hminf A„(j;o) < hmsupA„(3;o) < Ao(2;o-), 

n— 5-00 n-^oo 

Ao(2;o+) < hminf An(2;o) < hmsupA„(xo) < Ao(xo-), 

n-s^oo n-5-oo 

with probability one. 

(Hi) Suppose that /o is nonincreasing on [0,oo) and let fn be the estimator defined in Sec- 
tion 2.3. Then, for any xq G {0,th), 

fo{xo+) < hminf /^(xo) < hmsup/„(a;o) < /o(a:o-), 

n— s>oo n— 5>oo 

with probability one, where foixQ — ) and /o(a;o+) denote the left and right limit at xq. 

The foUowing two theorems yield the asymptotic distribution of the monotone constrained 
baseline hazard estimators. In order to keep notations compact, it becomes useful to introduce 

(3.1) *(A.) = /{.>.)e^-dP(M..). 

for /3 G and x G M, where P is the underlying probability measure corresponding to the 
distribution of {T,A,Z). Furthermore, by the argmin function we mean the supremum of 
times at which the minimum is attained. 

THEOREM 3.2. Assume (Al) and (A2) and let xq G {0,th)- Suppose that Aq is nonde- 
creasing on [0,oo) and continuously differentiable in a neighborhood of xq, with Ao(xo) 7^ 
and Aq(2;o) > 0. Moreover, suppose that H^'^[x) and x 1— t- $(/3o,x) are continuously differ- 
entiable in a neighborhood of xq, where H'^^ is defi ned below (2.9) and <I> is defined in (3.1). 



Let An and An be the estimators defined in Section 2.1. Then, 

1 /3 

(3.2) { An(xo) - Ao(xo)| ^ argmin{W(t) + t^}, 

V4Ao(xo)Ao(a;o)/ I. J jgR 
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where W is standard two-sided Brownian motion originating from zero. Furthermore, 
(3.3) ni/3|A„(xo)-A„(xo)} 



4o, 



so that the convergence in (13. 21) also holds with replaced by Xn- 

Let A„ be the estimator considered in [2j, which has been proven to be consistent. Com- 
pletely similar to the proof of Theorem |3.2| it can be shown that 



n 



1/3 



n 



so that the convergence in (3.2) also holds with A„ replaced by A„. 

THEOREM 3.3. Assume (Al) and (A2) and let xq G {0,th)- Suppose that Xq is nonin- 
creasing on [0,oo) and continuously differentiahle in a neighborhood of xq, with Xo{xq) ^ 
and Aq(3;o) < 0. Moreover, suppose that H'^^{x) and x i— )• $(/3o,x) are continuously differ- 
entiahle in a neighborhood of xq, where H^'^ is defi ned below (2.9) and <I> is defined in (3.1). 
Let Xn and A„ be the estimators defined in Section 2.2. Then, 



(3.4) 



1/3 



$(/3o,xo) 



1/3 



|An(xo) - Ao(xo)| 4 argmin{W(t) + t^}, 



4Ao(xo)A'o(j;o) 

where W is standard two-sided Brownian motion originating from zero. Furthermore, 

i'/'{A„(xo)-A„(xo)} Ao, 



n 



so that the convergence in (13. 4^ also holds with X^ replaced by X^i. 

Note that in the special case of no covariates, i.e., Z = 0, it follows that $(/3o,xo) = 
1 — H{xo), so that with the above results we recover Theorems 2.2 and 2.3 in |10j . If, in 
addition, one specializes to the case of no censoring, i.e., ^{(3o, xq) = 1 — H{xq) = 1 — F{xq), 
we recover Theorems 6.1 and 7.1 in [17J. The asymptotic distribution of the baseline density 
estimator is provided by the next theorem. 

THEOREM 3.4. Assume (Al) and (A2) and let xq € (0,r//). Suppose that /o is nonin- 
creasing on [0,oo) and continuously differentiable in a neighborhood of xq, with fo{xQ) ^ 
and f^ixo) < 0. Let Fq be the baseline distribution function and suppose that H'^'^{x) and 
X I— >• ^{f3Q,x) are continuously differentiable in a neighborhood of xq, where H ^^ is defined 
below (2.9) and ^ is defined in (3.1). Let fn be the estimator defined in Section 

1/3 



2.3. Then, 



n 



1/3 



|/n(a;o) - /o(2;o)} 



4/o(2;o)/^(xo)[l-Fo(xo)] 
where W is standard two-sided Brownian motion originating from zero. 
In the special case of no covariates, it follows that 

$(/3o,xo) l-H{xo) 



argmin{W(t) + 1 }, 



l-Fo(xo) l-F{xo) 



l-G(xo), 



so that the above result recovers Theorem 2.1 in jlO]. If, in addition, one specializes to the 
case of no censoring, i.e., G{xq) = 0, we recover Theorem 6.3 in [16] and the corresponding 
result in |8]. 
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4 Consistency 

The strong pointwise consistency of the proposed estimators wih be proven using argu- 
ments similar to those in 1191 and 1101. First, define 



(4.1) $n(/3,x) = J{u>x}e''''dFn{u,6,z), 



for /3 G and x >0 and note that the Breslow estimator in (2.8) can also be represented as 



(4.2) An{x)= I dPn{u,6,z), X > 0. 



^'n(/3n,w) 

To establish consistency of the estimators, we first obtain some properties of <5„ and as 



defined in (4.1 ) and (3.1 ) and their first and second partial derivatives, which by the dominated 



convergence theorem and conditions (Al) and (A2) are given by 

^(1) ^) ^ ^5^^ = j{u>x]z e^'^ dP{u, 6, z) e BP, 
D^^\p,x) = ^-^^^^^ = J{u>x}ze^'^dFn{u,6,z) gW, 
d(2) (/3, x) = ^^y^^^'^^ = j{u>x] zz' e^'" dP(n, 5, z) x 
D^^\P, x) = = > ^} e'^'^ dF„(n, 5, z) G x BP. 

LEMMA 4.1. Suppose that (A2) holds for some e > 0. Then, for any < M < th, 
(^) 

0< inf inf |$(^,x)|<sup sup |<I>(/3, x)| < oo. 

x<M\p-Pa\<e xm.\p-l3o\<e 

(ii) For any sequence /?*, such that /3* — )• /3q almost surely, 

< liminf inf |<I>„(/3*, x)| < limsupsup |<^„(/3*,x)| < oo, 

n->oo x<M n-¥oo xeR 

with probability one. 
(Hi) For i = 1,2, 

sup sup \D^'^\(3,x)\ < oo. 
xm \i3-M<£ 

(iv) For i = 1,2 and for any sequence /?*, such that f3^ — )• /3o almost surely. 



limsupsup |Z)^*^(/3*,x)| < oo, 



with probability one. 
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Proof. First, notice that for every x < M and f3 gMP, 
(4.3) < $(/3,M) < 

and for every x G M and |/3 — /3o| < 



(4.4) 



$(/3,x) < ^>(/3,0) < sup E 

\I3-M<e 



< oo. 



Hence, by dominated convergence, for every x < M, the function /3 i— t- <5(/3,a;) is continuous 



and therefore attains a minimum on the set \f3 — f3()\ < e. Together with ( |4.3| ) and (4.4), this 
proves (i). 

To show (ii), note that similar to (4.3) and (4.4), for every x G [0, M] and /3 G W, 

(4.5) ^'„(/3,M) <$„(/?, x) 
and for every x G M and /3 G M^, 

(4.6) ^>„(/3,x)<$„(/3,0). 

Choose e > from (A2) and let 6 = e/2y/p. Strong consistency of /3* yields that, for n 
sufficiently large, 

/3oi - (5 < < Poj + <5, for ah J = 1, 2, . . . , p, 

with probability one. Next, consider all subsets 1^ = {ii, ^2, • • • , ^fc} ^ {l,2,...,p} = /. 
Then, for each Ik fixed, on each event 

n {Z^J > 0} fl {Z,i < 0}, where Zi = {Za,..., Zip)' G W, 

we have 



Define 0^,7^ G W with coordinates 



Poj - 5, j (^I\ Ik- 



Then \/3o — Ofcl < e and |/3o — 7fe| < e and together with (4.5) and (4.6), we find that for every 
X < M, 



(4.7) 

and for every x G M, 
(4.8) 



Ii-CI K. i=l 
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By (A2) and the law of large numbers, 



Yl I n - [ ^ 2^ ^ [i^ ^ M}e<^ 



/fcC/ I 1=1 

with probability one and similarly, 



>0, 



(4.9) 



1 



E 



< sup E 

l/3-/3ol<£ 



< oo, 



with probability one. This proves (ii). 

To prove (iii), it suffices to show that the inequalities hold componentwise. For this, notice 
that for the jth element of the vector D^^\ 



sup sup 

xGM. |/3-/3o|<e 



E 



{T > x}Zj e^'^' 



< sup E 

l/3-/9ol<e 



< oo, 



by (A2). Completely analogous, a similar inequality can be shown for each element of D^'^K 
Finally, to prove (iv), note that similar to (4.8) and (4.9), for the jth component of Dn\ 
we can write 



sup 



1 



hci i=l 



< oo. 



with probability one, as n tends to infinity. Likewise, a similar result can be obtained for 

(2) 

each element of Dn ■ □ 
LEMMA 4.2. Suppose that condition (A2) holds and /3„ — t- with probability one. Then, 



sup 



0, 



with probability one. Moreover, 



(4.10) 



nsup |$n(/3o,x) - ^{Po,x)\ = Op{l). 



Proof. For all x G M, write 

|^n(/3n,x) -$(/3o,x)| < |«>„(/3„,x) - $„(/3o,x)| + |$„(/3o,x) -$(/3o,a;)|. 
For the second term on the right hand side, consider the class of functions 

Q = {g{u, z; x) : X e R} , 
where for each x G M and (3q = (/3oi, . . . , /3op) G W fixed, 

g{u, z; x) = {u> x}e^oi'^ie'^o222 . . . ^fSopZp 

is a product of an indicator and strictly positive monotone functions. It follows that G is 
a VC-subgraph class and its envelope G = e^"^ is square integrable under condition (A2). 
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Standard results from empirical process theory [22] yield that the class of functions G is 
Glivenko-Cantelli, i.e., 



(4.11) sup|$„(/3o,x) - ^>(/3o,x)| = sup 



g{u, z; x) d(Pn - P){u, 5, z] 



0, 



with probability one. Moreover, ^ is a Donsker class, i.e., 

/n / g{u,z;x)(l{¥n- P){u,5,z) = Op{l), 



so that (4.10) follows by continuous mapping theorem. Finally, by Taylor expansion and the 



Cauchy-Schwarz inequality, it follows that 



sup 



sup 



(/3„ - /3o)'/5l^)(r,x) <\Pn-M sup D^^\p\x 



for some /5*, for which |/3* — /3o| < |/3n — /3o|- Toge ther with (4.11), from the strong consistency 
of /3„ (e.g., see Theorem 3.1 in [21j) and Lemma 



4.1 



the lemma follows. 



□ 



The theorem hereafter furnishes the strong uniform consistency and the uniform conver- 



gence rate of the Breslow estimator in (4.2). Strong uniform consistency of A„ and process 



convergence of -y/n(A„ — Aq) has been established in |13j, under the stronger assumption of 
bounded covariates. Weak consistency has been derived or mentioned before, see for exam- 
ple [H]. 



THEOREM 4.1. Under the assumptions (Al) and (A2), for all < M < th, 

sup |A„(x) - Ao(x)| 0, 

xe[o,M] 



wit 



th probability one and \^sup^fz[o^M] l^nix) — Ao(x)| = Op(l). 



Proof. From the expression for the baseline cumulative hazard function in (2.10) together 



with (3.1) and (4.2), it follows that 



sup |A„(2;) - Ao(2;)| < sup 

x£[0,M] xe[0,M] 



[ 6{u < x} I 1- 



+ sup 

x£[0,M] 

+ sup 

xe[0,M] 



5{u < x} 



Starting with the first term on the right hand side, note that 

Wn-Po\ 



u) ^>(/3o,n) 
d(F„-P) {u,6,z) 



dFn{u,6,z] 



dFn{u,5,z) 



(4.12) 



An < — sup 

<^n{f3n,M)<l>M,M) x&R 



for some \/3* — /3o| < |/3n — Po\- According to Lemma 4.1 the right hand side is bounded by 
C|/3„, — /3o|, for some C > 0. Since /3„, is strong consistent and \f3n — /3o| = Op{n~^^'^), (e.g.. 
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see Theorems 3.1 and 3.2 in |21j). it follows that — )• almost surely and = Op{n ^/^) 
Similarly, 



(4.13) 



From Lemmas 



4.1 



and 4.2, it follows that Bn — almost surely and Bn = Op{n ^/^). For 



the last term C„, consider the class of functions H = {h{u, 6;x):x€^ [0, M]}, where for each 
X £ [0, M], with M <TH and /Sq £ W fixed, 



h{u, 6; x) 



5{u < x} 



Note that the function /i is a product of indicators and a uniformly bounded monotone 



function. Similar to the arguments given in the proof of Lemma 4.2, it follows that the 
class H is Glivenko-Cantelli, i.e., 



sup 

hen 



h{u,5r)d{Fn- P){u,5,z] 



0, 



almost surely, which gives the first statement of the lemma. Moreover, ^ is a Donsker class 
and hence the second statement of the lemma follows by continuous mapping theorem. This 
completes the proof. □ 

Strong consistency of Fn follows from the strong consistency of the Breslow estimator, as 
stated in the next lemma. 

COROLLARY 4.1. Under the assumptions (Al) and (A2) and for all < M < th, 



sup \Fn{x) - Fo{x)\ ^0, 

x£[0,M] 



with probability one. 



Proof. The proof is straightforward and follows immediately from Theorem 4. 1 , relations (2.12) 



and (2.13), together with the fact that |e ^ — 1| < 2|y|, as y — )• 0. 



□ 



To prepare the proof of consistency for the nondecreasing NPMLE estimator A„, we first 
establish the following lemma, which is completely similar to Lemma 4.3 in [IDj. 

LEMMA 4.3. Assume that Aq is convex on [0,th] and that conditions (Al) and (A2) hold. 
Let /3n be the maximum partial likelihood estimator and define 



(4.14) 



where Wn is defined in Let (W„(x), K(a;)) be the GCM of {Wn{x),Vn{x)) , for x G 

[T(i),T(„)], where Vn is defined in ( |2.5[ ). Then 



(4.15) 



sup 

a:e[T(i),T(„)] 



Vn{x) - V{x) 



0, 



with probability one, where V{x) = H'"'^{x), as defined just below (2.9) 
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Proof. By Glivenko-Cantelli, 
(4.16) 



sup \Vn{x)-V{x)\^0, 



almost surely, because of the continuity of V. Furthermore, note that 



(4.17) 



(1) 



$„(/3„, s) ds = T(i)«>„(/3„, r(i)) ^ 0, 



almost surely, since ^n{Pn,s) is bounded uniformly according to Lemma 4.1 and T(i) 
with probability one, by the Borel-Cantelli lemma. Moreover, if we define 



(4.18) 

then we can write 
(4.19) 



Woix) = Wi(3o,x) 



where $ is defined in (|3.1|). It follows that 

< 



(4.20) 



sup 

a;6[r(i),T{„)] 



Wn{x) - Wo{x) 



sup 



«>„(/3„,s)-$(/3o,s) ds 



< th sup 



^n{Pn,x) - ^>(/3o,x) 



0, 



with probability one, by Lemma 4.2 



Take W"^ to be the inverse of Wn, which is well defined on [0, T4^„(7'(n))]) since Wn is 
strictly monotone on [T(i),T(„)]. We first extend Wn to [T(i),oo) and W'"^ to [0,oo). Define 
Wn{t) = t?n(r(„)) + (t - r(„)), for all t > r(„), so that W-\y) = T^^) + iv - t^n(T(„))), for 
y > Wn{T^n))- Similarly, take Wq^ to be the inverse of Wq, which is well-defined since Wq 
is strictly monotone on [0, rjy] and extend Wq and Wq^ to [0, oo), by defining Wo{t) = 
Wo{th) + {t- th), for ah t > th, so that W^^y) = th + {y - Wq{th)), for y > Wq{th). It 
follows that the extension W(^^(7/) is uniformly continuous on [0,oo). Immediate derivations 
give that 



(4.21) 



sup 

0<?;<M7n(T(„)) 



W-\y)-W^\y) 



0, 



with probability one. Furthermore, it can be inferred that 



5n = sup 

ye[0,Ty„(T(„))] 

< sup 

ye[0,l?„(T(„))] 



ynoW-\y)-VoW^\y) 
{Vn-V)oW-\y) 



VoW-\y)-VoW^\y) 



< sup \Vn{t) - V {t)\ + sup 

^0, 



+ sup 

2^e[o,iy„(T(„))] 

Vo(w-\y)-W,\y) 
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almost surely, by (4.16), (4.21) and the continuity of V. Notice that according to (2.9) 
and (4.19), Aq can also be represented as 



(4.22) 



Ao(x) 



dV{x)/dx 
dWo(x)/dx' 



which is well-defined for x G [0,th), since $ is bounded away from zero, by Lemma 4.1 
Taking x = W^^iy), gives that 

Therefore, convexity of Aq implies convexity of y o Wq^ and subsequently of ^ o Wq^ — 5n- 
Moreover, from the definition of 6n, it follows that for every y G [0, Wn{T(^n))]7 

VoWoHy)-6n < Vno W~\y). 

As 14 o W~^{y) is the greatest convex function below Vn o W~^{y), we must have 

V o W^\y) -6n<VnO W-\y) < K o W-Hv), 

for each y € [0, Wn{T(^n))]- Re- writing the above inequalities leads to 

-Sn < Vn o W;;\y) - V o W^\y) < K o W-\y) - V o W^Hy) < 6^. 

Taking the supremum over [0, VF„(T(„))] then yields 



(4.23) 



sup 

yG[0,l?„{T(„))] 



VnoW-\y)-VoW^\y) 



0, 



with probability one. From (4.21), (4.23) and the continuity of V , we conclude that 

y^-v)oW-\y) 



sup 



Vnit) - V(t) 



= sup 

?/e[o,Ty„(T(„))] 

< sup 

ye[o,vy„(T(„))] 



+ sup 

j/G[0,t?„(T(„))] 



ynoW-\y)-VoW^\y) 
VoW^\y)-VoW-\y) 



with probability one. 



Obviously, in the nonincreasing case, similar to (4.16) one can show 
(4.24) 



sup 

xG[0,T(„)] 



Yn{x) - V{X) 



0, 



□ 



almost sur ely, where {Wn{Pn,x),Yn{x)) is the LCM of (W„(/3„, x), y„(j;)) , with defined 
in (2.11). We are now in the position to prove Theorem |3.1[ which establishes strong consis- 
tency of the estimators. 
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Proof of Theorem \3 . 1\ First consider the second statement of case (i). Since A„ is convex 
on the open interval (0, rj^), it admits in every point xq G {Q,th) a finite left and a right 
derivative, denoted by A~ and respectively. Moreover, for any fixed xq G (0, th) and for 
sufficiently small (5 > 0, it follows that 

< A„ [xq) < AI[xq) < . 

When n — )■ oo, then for any < M < th, 



(4.25) 



sup 

xG[0,Af] 



A„(x)-Ao(x) < sup |A„(x) - Ao(x)| . 

xe[o,A/] 



This is a variation of Marshall's lemma and can be proven similar to (7.2.3) in |19| or 
Lemma 4.1 in |10j . By convexity of Aq and the fact that A„ is the greatest convex func- 
tion below A„, one must have 

Ko{x)-5n < An{x) < A„(x), 



where Sn = sup^g[o,A/] l^o(a:^) — An{x)\, which yields inequality (4.25). From (4.25) and The- 
orem |4yT| by first letting n — )• oo and then 5 — )• 0, we find 

^o{xo—) < liminf A^(xo) < limsupA~(xo) < limsupA+(xo) < Ao(xo+). 



Because \n{xo) = A~(xo), this proves that A„ is a strong consistent estimator. 

For \n, note that since Vn is convex on the open interval {0,th), it admits in every point 
xq G {0,th) a finite left and a right derivative, denoted by V~ and respectively, where 



V-(x) 



V+(x) 



lim 



Vnjx) - Vnjx - 6) 
Wnix)-Wn{x-5y 

lim Mx + S)-Vn{x) 
™R(x + 5)-W„(x)' 



For any fixed x G (0, th) and for sufficiently small (5 > 0, it follows that 

< Vn i.X0) < yn{xo) < 



Wn{xo) - Wn{xo - 6) 



Wn{xo + 5)-Wn{xo) 

By making use of Lemma 4.3 together with (4.20) and letting n — )■ cxd, 
y(xo) - y(xo - 5) 



< liminf (xq) < limsupy^(xo) < 



V{xo + 6)-V{xo) 
Wo{xo + 6)-Wo{xoy 



Woixo) - Woixo - 6) 
Furthermore, by letting 5 — )• 0, together with ( 4.22[ ) we get 

^o{xo-) < liminf y„"(xo) < limsup (xq) < lim sup "14+ (xq) < Ao(xo+), 



which completes the proof of (i), since A„(xo) = (xq). The proofs of (ii) and (iii) are 
completely analogous, using (4.24) and Corollary 4.1 □ 
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5 Inverse processes 



To obtain the limit distribution of the estimators, we follow the approach proposed in [8]. 
In order to keep the exposition brief, we do not treat all five separate cases in detail, but we 
confine ourselves to the most important ones, as the other cases can be handled similarly. In 
the case of a nondecreasing Aq, the distribution of the NPMLE An can be obtained through 
the study of the inverse process 



(5.1) 



Un{a) = argmin | Vn(x) - aWn{x) 



for a > 0, where Vn and Wn have been defined in (2.5) and (4.14). Succeedingly, for a 
given a > 0, the switching relationship holds, i.e., U^{a) > x if and only if Xn{x) < a with 
probability one, so that after scaling, it follows that 



(5.2) 



n^^^ {a„(xo) - Ao(xo)} >a^ n^/^ ^jj^(^Xoixo) + n-^/^a) - xq] 



<0, 



for < xq < th, with probability one. A similar relationship holds for A^ and the corre- 
sponding inverse process 



(5.3) 



U^{a) = argmin {A„(x) — ax} . 
xe[o,T(„)] 



For the nonincreasing density estimator /„, we consider the inverse process 
(5.4) Ul{a) = argmax {Fn{x) — ax} , 

xG[0,T(„)] 



where argmax denotes the largest location of the maximum. In this case, instead of (5.2), we 
have 

(5.5) |/„(a;o) - /o(xo)} >a^ n^/^ {ul{fo{xo) + n-'/^a) - xq} > 0, 

Similarly, in the case of estimating a nonincreasing Aq, we consider inverse processes 
and defined with argmax instead of argmin in (5.1) and (5.3) and we have switching 
relations similar to (5.5). 



From the definition of the inverse process in (5.3) and given that the argmin is invariant 



under addition of and multiplication with positive constants, it can be derived that 

(5.6) \u^{Xoixo) + n^^/^a) - xq} = argmin |z^(x) - ax} 

where In{xo) = [—n^^^xo,n^^^{T(^n) ~ xq)] and 

(5.7) Z^(x) = |a,(xo + n-^/^x) - A„,(xo) - n-^/'^XoixQ)x] . 
Likewise, n^^^{U^{Xo{xo) + n~^/^a) — xq} is equal to 

Wnixo + n-^/^x)-Wnixo)\ \, 



(5.8) 



argmin < Z„(x) 

xeli^(xo) I 



ri^/^a 



$(/3o,xo) L 
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where 4(xo) = [-n^/^{xo - r(i)), n^/3(r(„) - xq)] and 
(5.9) 



^(/3o,a;o) 



i^Vn{xo+n-^/^x) -Vn{xo) 

-Xo{xo) Wn{xo + n~^^'^x) -Wn{xo) }, 



and similarly 
(5.10) 

where 
(5.11) 



n 



1/3 



{Ulifoi 



xq) +n ^/^a) - xo 



}=argmax{^/(x)-ax}, 

a:e-fn(i.'o) 



^{(x) = n2/3 |f„(^o + n-i/3x) - F„(xo) - n-i/3/o(xo)x} . 



In the case of estimating a nonincreasing Aq, we consider the argmax of the processes (5.9) 



and (5.7). Before investigating the asymptotic behavior of the above processes, we first need 



to establish the following technical lemma. 

LEMMA 5.1. Assume (Al) and (A2). Let xq S (0,t//) fixed and suppose that 
(5.12) H^^ is continuously differentiahle in a neighborhood of xq. 

Then, for any k = 1,2, . . ., 

1 



sup 

\x\<k 



6 ({u <xo + n-i/^x} -{u< xo}) { -^-7^ 



,u) ^{Po,u) 

is of the order Op{n^'^^^ logn). 

Proof. Take < x < A; and consider the class of functions 
(5.13) J^n = {fniu,5,z;x) : < x < k} , 

where for each < x < A;, 

1/3. ^ ^ 1 1 



d{Fn-P){u,6,z) 



fn{u, 5, z; x) = 5{xo < u < xq + n ' x] 



_$„(/3o,n) ^{Po,u) 
Correspondingly, consider the class Gn,k,a consisting of functions 

g{u,6,z;y,'i/) = 6{xo <u<xo + y} ' 



where < y < n^^/'^k and ^' is nonincreasing left continuous, such that 

^'(xo + n^^/^/c) > K and sup \^{u) - ^{I3q,u)\ < a, 

«e[o,M] 

where K = ^*(/3o; {xq + th)/2)/2. Then, for any a > and A; = 1, 2, . . ., 

PiJ'n C gn,k,a) ^ 1, 
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by Lemma 4.2 Furthermore, the class Gnk a has envelope 



G{u, 6, z) = 5{xq <u<XQ + n ^^'^k] , 



for which it follows from (5.12), that 

Giu,6,zfdPiu,6,z) = 



a 



P{xo <T <xo + n-^/'-^k, A = 1) = ©(a^/cn^^ 



Since the functions in Qn,k,a suius and. products of bounded, monotone functions, its 
entropy with bracketing satisfies 

logiV[](e,g„,fe,„L2(P)) < ^, 

see e.g., Theorem 2.7.5 in |22j and Lemma 9.25 in [13], and hence, for any 6 > 0, the bracketing 
integral 

J[]{S,Gn,k,a,L2iP)) = J yJl + logN[]{e\\G\\2,Gn,k,a,L2iP))de < OO. 

By Theorem 2.14.2 in [22j, we have 



E 



n / g{u,5,z;y,^)d(Pn- P){u,6,z) 



< J[]{l,0n,k,o^,L2{PmG\\p,2 

= 0{ak'/^n-^/'^), 



where || • \\jr denotes the supremum over the class of functions J-. Now, according to (4.10) 

(logn)~^Vnsup \^M,x) - ^{Po,x)\ 0, 

in probability. Therefore, if we choose a = logn, this gives 



E 



j giu, 6, z; y, ^')d(F„ - P){u, 6, z] 



0{k^/^n-'^/^ logn) 



and hence by the Markov inequality, this proves the lemma for the case < x < A;. The 
argument for — /c < x < is completely similar. □ 

Our approach in deriving the asymptotic distribution of the monotone estimators involves 
application of results from [T^. To this end, we first determine the limiting processes of (5.9), 
and KU\. 



LEMMA 5.2. Suppose that (Al) and (A2) hold. Assume \b.l2\ and that 

(5.14) Ao is continuously differentiahle in a neighborhood of xq. 
Moreover, assume that 

(5.15) X I— )• <I>(/3o,x) is continuously differentiahle in a neighborhood of xq. 
Then, for any k = 1,2, . . ., 



sup 

\x\<k 



Z^(x)-Z^(x) ^0, 

in probability, where the processes and TL\ are defined in (|5.7l) and (|5.9|), respectively. 
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Proof. We will prove that for any k = 1,2, ... , 

sup ^^(x) - Z^(x) 



0, 



in probability, since the result for — A; < x < follows completely analogous. Write 

$(/3o,xo) fe(x)-Z^(x)' 



n 



2/3 



[6\xo<u<xo + n-'/'x} ( ^i^^ - 1 ) dF„(n, 5, 



'/'Ao(xo) 



XQ+n ^i'^x 



n 



2/3 



+ n 



n 



|$(/3o,xo) - ^>„(/3„,s)|ds 
/ < n < .0 + n-V34 f _ dF„(., 5, .) 

i ^ J V^n(/3„,n) fn(/30,«)y 



'/'Ao(xo) 



{^>(/3o,xo) - ^>„(/3o,s)}ds 



n'/'Ao(xo) 



XQ+n ^/^x 



xo 



|$„(/3o,s) -$„(/3n,s)}ds 

= Anl{x) + A„2(x) + A„3(2;) + AnA{x). 

We will show that the supremum of all four terms on the right hand side tend to zero in 



probability. Similar to (4.12), according to Lemma 4.1 



\Ani{x)\ <C\Pn- Po\n^'^ j {xo<n<xo + n-i/3x} dF„(u,5,z), 
for some C > 0. Since, |/3„ - /3o| = Op{n~^/'^) and 



/ \xq < u < xq + n "^/^xj d(Fn - P){u, 5, z) = Op(n~2/3^i/2^ ^ q^^- 
it follows that 

(5.16) \Ani{x)\ = Op(n-i/2xi/2) + OpCn-^/^x), 

and likewise, |A„4(x)| = Op(n~^/^x). Furthermore, write 

■$(/3o,xo) $(/3o,a;o) 



d(F„-P) {u,5,z) 



A„2(x) = n^/^ / ^xo < n < xo + n~'l'x] , ^ ,^ , 

+ n2/3 I 5 {xo < n < xo + n- V3^} - l) d (F„ - P) (u, 6, 



.n^/3/.{xo<..xo.n-V3.}(|I|^ 



l/3^\ f HPo^Xq) _ ^{Po,Xo) 



dP{u, 6, z) 



+ n 



2/3 



Xo < ti < Xo + n ^^^x 



1 / .|>(/3o,xo) _ \ 



u, 5, z) 



Bnl{x) + Bn2{x) + 5^3 (a^) + 5n4(x). 
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According to Lemma 5.1 



(5.17) sup \Bni{x)\=Op{n-^/^logn). 

0<x<k 



For the term Bn2, consider the class J-' consisting of functions 

(Hi 



f{u, 6, z; x) = 5{xo < n < xq + n~^/^x} ( ^^^^"'"""^ - 1 



where < x < A;, with envelope 

F{u) = S{xo <u<xo + n-^/^k} ( — — 
Then, the L2{P) norm of the envelope satisfies 



^(/3o,2;o) 



xo + n ^/^k) 



HPo,xo) 



$(/3o,xo + n-V3A;) 



Ij [H''\xo + n-^l^k)-H^\xQ)^=0{n~^), 



according to (5.12) and Lemma 4.1, so that by arguments similar as in the proof of Lemma 5.1 
(5.18) 



sup \Bn2{x)\=0p{n-^''^). 

0<x<k 



For the term B^s, similar to the treatment of the right hand side of (4.13), it follows that 
(5.19) \Bn3{x)\ < n2/30p(n-i/2) /7"'=(xo + n-^/^x) - /7"^(xo) = Opin-^^^x), 



by condition (5.12). Next, we combine Bn4{x) with An3{x). First write 



pXQ--\~'ri ^ ^ ''^ X 

2/3Ao(xo) / {c^„(/3o,s)-<i>(/3o,s)} ds 

+ n2/3Ao(xo) / {^>(/3o, s) - $(/3o, xq)} ds 

J XQ 

= Cnl{x) + Cn2{x). 



As for Cni, 



(5.20) \Cni{x)\ < n^/3xAo(xo) sup |$n(/3o,x) - HPo,x)\ = Op{n-^/^x) 



according to Lemma 4.2 Finally, using (2.9) and (3.1), 



pxo+n ^I'^x 

Bn4ix) + Cn2ix) = n^/^ / {$(/3o, xq) - '^iPo,u)} Xo{u) du 

J XO 



(5.21) 



n2/3Ao(xo) 



{<J>(/3o,s)-<l>(/3o,xo)} ds 



XO 



fxo+n 

n2/3 / {$(/?o, s) - mSo, xo)} {Xo{s) - Ao(xo)} ds 

J XO 



Op(n-i/3a,), 



22 



by conditions (5.15) and (5.14). We conclude that 

(5.22) $(/3o, xo) |z^(x) - Z^{x)\ = Op(n-V2^i/2) ^ o^(„-i/6^) + 0^(^-1/3)^ 

and after taking the supremum over [0, k], the lemma follows. 



□ 



To find the limit process of Z^, we will a pply results from |12j . The limit distribution 



for Z„ will then follow directly from Lemma 5.2 Let B;oc(IS) be the space of all locally 



bounded real functions on M, equipped with the topology of uniform convergence on compact 
domains. 



LEMMA 5.3. Assume (Al) and (A2) andjet < xq < th- Suppose that ( |5.12[ ), ( |5.14 ) 
and (5.15) hold. Then the processes and 'L\ defined in (5.9) and (5.7) converge in distri- 
bution to the process 

in BiocO^), where W is standard two-sided Brownian motion originating from zero. 
Proof. We will apply Theorem 4.7 in [12] . To this end, write the process in (5.9) as 
(5.24) Z^(a;) = -n^/'^FnOi; n~^/^x) + n^'^Rn{x), 

for x G [—n^/'^{xQ — T(i)), n^/'^(T(„) — xq)], where for Y = {T,A,Z) and 9 G [—xq,th — xq], 

g{Y,9) = -gi{Y,9)+g2{Y,9), 



(5.25) 

Furthermore, 

Rn{x) 



{{T <xo + 9}-{T <xo}) 
Ao(xo)e^o^ 



A 



^if3o,xo) 



^(/3o,a;o) 



/ {r>s}ds. 

J Xo 



^^^^{ [Wnixo + n-V3^) _ Wnoixo + U-'/^x)) - (Wn{xo) - Wno{xo)) }, 



where Wno{x) = Wn{(3o,x), with Wn defined in (2.4). For all A; = 1, 2, ... , consider 



\Rnix)\ < 



Ao(xo) 
^(/3o,xo) 



{s<xo + n ^/^x} -{s< Xo} $„(/3n, s) - $„(/3o, s] 



ds, 



which by similar reasoning as in (4.12) gives that 
(5.26) \Rn{x)\ = Op{n-^/^x) 



by Lemma 4.1 Hence, the process x i— t- n^^^Rn{x) tends to zero in B;oc(IS)- It is sufficient 
then to demonstrate that — n^/^F„5f(-, n~^/^x) converges to 'E{x) in B;oc(M). To this end, we 
will show that the conditions of Lemma 4.5 and 4.6 in [12] hold. Condition (i) of Lemma 4.5 
is trivially fulfilled, since = is an interior point of [—xq, th — xq\. Moreover, observe that 
for all 9 G [—xq.,th — xq], from (2.9) and (3.1), we have 
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(5.27) 



Thus, by ( [5l5| ) and ( [sUl ), 



xo+e 



{Ao(m) - A(xo)}$(/3o,tt)dii. 



de 



f d^{^o,xo + e)\ Ao(xo + e) - Ao(xo) $(/3o,xo + 



de 



-x'oixo + e). 



It follows that Pg{-,9) is twice differentiable at = 0, its unique maximizing value, with 
second derivative — Ao(xo) < 0, which establishes condition (iii) of Lemma 4.5 in [T2]. Next, 
compute 

H{s,t) = lim aPg{-,s/a)g{-,t/a), 

a— >oo 

for finite s and t. Write 

aPg{-,s/a)g{-,t/a) = aP {-gi{-, s/a) + g2{-, s/a)} {-gi{-,t/a) +g2{-,t/a)} 
and compute the four terms separately. Notice that for all s and t, 

(5.28) aP \gi{; s/a)g2{; t/a)\ < ^^^^ \\{T < xq + s/a} - {T < xo}\e^^^] ^ 

as a — )• CO. Completely analogous, it follows that 

(5.29) lim aPg2{-,s/a)g2i-,t/a) = 0, 

for all s and t. Furthermore, for st < 0, we have Pgi{-,s/a)gi{-,t/a) = 0. For s,t>0, 

a f 

aPgi{-, s/a)gi(-,t/a) = a — ~ I ^i^o < u < xq + {s A t) / a} dP(u, 6, z) 

xo + {sAt)/a 



0, 



$2(/3o 


Xq) 


a 




^2(/3o 


Xq) 


1 






Xq) 



sAt 



Ao(n)$(/3o,u) du 
Ao(xo + v/a)^{l3o,xo + v/a) dv. 



by (2.9) and (3.1). Therefore, by the continuity of Aq and 

>^q{xo) 



(5.30) 



lim aPgi{-,s/a)gi{-,t/a) 



^{Po,xo[ 



is At). 



A similar reasoning applies for s,t < 0. Hence, condition (ii) of Lemma 4.5 in |12j is verified, 
with 

H{s,t) = ^^(\s\A\t\), 

for st > and H{s, t) = 0, for st < 0. Note that H{s, t) is the covariance kernel of the 
centered Gaussian process in (5.23). For condition (iv) of Lemma 4.5 in [12], it needs to be 
shown that for each t and e > 



(5.31) 



lim aPg{-,t/a)'^{\g{-,t/a)\ > ae} = 0. 
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In view of (5.28) and (5.29), it suffices to show that 

hm aPgi{- ,t / a)^ {\g{- ,t / a)\ > ae} = 0. 



Moreover, since gi is bounded uniformly for 6 € [—xo,th — xq], by Lemma 4.1 



{\g{-,t/a)\ > ae} < {\g2{-,t/a)\ > ae/2} < —\g^{.,t/a)\, 

ae 



for a sufficiently large. By (5.28), it follows that 



aPgi{-,t/af{\g{-,t/a)\ > ae} < -Pgi{-,t/af \g2{-,t/a)\ 



< 



e$(/?o,M) 



P\gii-,t/a)g2{-,t/a)\^0. 



Hence all conditions of Lemma 4.5 in |12j are satisfied. 

To continue with verifying the conditions of Lemma 4.6 in [12], consider the class of 
functions G = {g{-, 6) : 6 £ [— xq, th — xq]} and the classes 



(5.32) 



GR = {9i;0)eg :\e\<R} 



for any R > 0, i? in a neighborhood of zero. Since the functions in Qpi are the sum of 
a bounded monotone function and the product of strictly positive monotone functions, it 
follows that Gr is a VC-subgraph class of functions, and hence it is uniformly manageable, 
which proves condition (i) of Lemma 4.6 in |12l . Furthermore, choose as an envelope for Qn, 



(5.33) 
where 

(5.34) 



Gr — Gri + Gr2, 



Gr,i{T,A,Z) 
Gffi(T,A,Z) 



{xo-R<T <xo + R} 



Calculations completely analogous to (5.28), (5.29) and (5.30), with 1/R playing the role of 
a — )• oo, yield that PG| = 0{R), as R^O. This proves condition (ii) of Lemma 4.6 in |12j . 
To show condition (iii) of Lemma 4.6 in |12| . ffist note that 



p\gi-,ei)-g{-,e2)\ < p\gi{-,ei) - gii-,e2)\ + P\g2{-,ei) - g2i;e2)\. 



Now, 



P\gi{;ei)-gi{-,e2) 



1 



according to (5.12). Analogously, 

P\92i;0i)-g2i;92)\ < 



iH^'ixo + 01) - H^'ixo + 02)1= 0{\ei - 02|), 



Ao(a;o) 



^(/3o,2;o) 



oi\ei-e2\), 
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by (A2), which proves condition (iii) of Lemma 4.6 in |12j . Finally, to establish condition (iv) 
of Lemma 4.6 in [12], we have to show that for each e > 0, there exists K > such that 

PG%{Gr >K}< eR, 



for R near zero. The proof of this is completely analogous to proving (5.31 ), with 1/R playing 
the role a — >• c«. This shows that all conditions of Theorem 4.7 in are fulfilled, from which 
we conclude that the process — n^/^F„g(-, converges in distribution to the process 



Together with (5.24) and ( |5.26| ), thi s pro ves the weak convergence of Z^. Weak convergence 



of is then immediate, by Lemma 



5.2 



□ 



As a consequence, we obtain the limiting distribution of the process in (5.8). 



LEMMA 5.4. Assume (Al) and (A2) and suppose that ( |5.12[ ), ( |5.14[ ) and ( |5.15[ ) hold. Let 

-'n 



< xq < th and a > fixed and let Z„ and Wn be defined in (5.9) and (4.14). Then, the 
process 



%i{x) 



^n{xo + n-^l'^x)-Wn{xQ)^ 



converges weakly to 



- ax = W ( ^^P^,x\ + \\'^{x^)x^ - ax, 
\$(/3o,xo) / 2 



in Bio(.{M.), where W is standard two-sided Brownian motion originating from zero. 



Proof. In view of Lemma 5.3, it suffices to show that for any k = 1,2, 



(5.35) 



sup 

|a;|<fc 



n 



1/3 



{Wnixo + n-^/^x) - Wn{xo)} " $(/3o,Xo)i 



almost surely. This is immediate, since similar to (4.20), together with the monotonicity 
of $(;So, u), one has 



(5.36) 



xo+n ^^'■^x 



xo 



du 



'^n{Pn,u) - $(/3o,Xo) 
$n(/3n, U) - ^{Po, U) + |^'(^o, 3^0 + U'^^^x) - $(/3o, Xq) 

o{x) + 0{n-^/^x). 



< \x\ sup 



almost surely, using Lemma 4.2 and (5.15) 



□ 



Finally, the next lemma provides the limit process of Z 



/ 
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LEMMA 5.5. Assume (Al) and (A2). Let xq G (0,t//) and suppose that ( |5.12[ ), ( |5.14 ) 
and (5.15) hold. Then the process iJn defined in (5.11) converges in distribution to the process 

(x) = W 



(5.37) 



/o(xo)(l-Fo(xo)) \ 1 . 2 



in BiocO^), where W is standard two-sided Brownian motion originating from zero. 

Proof. From (5.7), we have A„(xo + n^^^^x) — A„(xo) = n^^/^Z^(x) + n^^/^Ao(xo)x, so that 

by (ra, 



(5.38) 



n 



2/3 

2/3 I . 



-A„(a;o+n-l/3a::) 



-A„(a:o) 



'/'/o(xo)x} 



_g-A„(a;o) /g-n-2/3z^(x)-n-i/3Ao(xo)x _ 



n 



"'/Vo(:ro)x} 



Because e ^ — 1 = — ?/ + + o(y^), for y — )• and sup^jgi^ |Z^(j;)| = Op(l), according 



to Lemma 5.3 it follows that 



-n-2/3z^(x) - n-^/^Xo{xo)x + ^n-^/^Xo{xofx^ 



Similarly, from Theorem 



4.1 



+ Op{n~^/^) + Opin-^x) + Op(n-2/3a;2). 
we have that e'^^^^'o) = e-'^o(^o) + Opin-^/"^). Since 



-^o("«)Ao(xo) = (1 - Fo(xo))Ao(xo) = /o(xo). 



from (5.38), we find that 
(5.39) 



Zi{x) = (1 - Fo(xo))Z^(x) - -(1 - Fo(xo))Ao(xo) V 



According to Lemma 5.3, the process (1 — Fo(xo))Z^(x) — ^(1 — Fo(xo))Ao(xo)^x^ converges 
weakly to 



(1 - Fo(xo))W 



^\f^'lf ) + - Fo{xo))Uxo)x' - ^(1 - Fo(xo))A2(xo)x2, 



which has the same distribution as the process in (5.37), by means of Brownian scaling and 
the fact that 



(5.40) 



A' 



/o V ^ (l-i"o)/o + /o^ ^ , .2 

l-FoJ (l-Fo)2 1-Fo^ 



Hence, for any A; = 1, 2, . . ., it follows from (5.39) that 

sup |Z{(X)-Z^(X)| =Op(l), 
|x|<A: 



which finishes the proof. 



□ 
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6 Limit distribution 



The last step in deriving the asymptotic distribution of the estimators is to find the limiting 



distribution of the inverse processes U^, and ul defined in (l5^, M and M and of 
the versions of and in the case of a nonincreasing hazard, by applying Theorem 2.7 
in [12]. This requires the inverse processes to be bounded in probability. 

LEMMA 6.1. Assume (Al) and (A2) and let xq G {0,th)- Suppose that Aq is monotone 
and suppose that /o is nondecreasing. Suppose that (5.14) and (5.15) hold, with Ao(a;o) 7^ 0. 
Then, for each e > and Mi > 0, there exists M2 > such that, for n large enough, 



(6.1) 
(6.2) 
(6.3) 



F( max n^^^ U^{Xo{xo) + n - Xq 

\\a\<Ah 

f( max n^^^ U^{Xo{xq) + n'^ - Xq 

\\a\<Mi 



> M2 ] < e 

> M2 1 < e 



F ( max n^/^ Ul{foi 
Vh<Mi "^''^ 



Xq) + n ^/^a) - Xq 



> M2] < e, 



for n sufficiently large. 

Proof. The proof of the lemma follows closely the lines of proof of Lemma 5.3 in |9] (see also 
Lemma 7.1 in |10j). First consider (6.1) in case Aq is nondecreasing. It will be shown that 



(6.4) 



F ( max n^/^ [u^{\q{xo) + n~^/^a) - xq) > M2 
Ja|<Mi I J 



as the other part can be proved similarly. Because U^{a) is nondecreasing, the probability 
in (6.4) is equal to 

F {f/„(Ao(xo) + n-i/^Mi) - xo} > M2) . 



The relationship between the inverse process and the process defined in (5.9), together 
with the fact that Z^(0) = 0, implies that 



F 



(6.5) 



n 



^(/3o,2;o^ 



^n{xo + n-^'^x) 



< 0, for some x > M2 



By condition (5.14), there exists Mq > such that, for any x G [T(i),T(„)] with |x — xo| < Mq, 
Xq{x) > and Ao(x) is close to Ao(xo). Take n~^/^x < Mq. From (|5.24[) and (|5.35[), note that 



(6.6) 



^nix) 



n 



-/3Mi 



^(/3o,a;o) 



{Wnixo + n-^/'^x) 



Wr 



{xo)^ 



-n 



'/^Fn5(-, 



n 



-1/3 



x)-Mix + Rn{x), 



where Rn{x) = Op{n ^^^x) + o{x) + 0{n ^^^x), by (5.26) and (5.36). Furthermore, for 
< R < Mq, consider the class of functions Gr defined in (5.32) along with its envelope 
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defined in (5.33). It lias been determined in tlie proof of Lemma 5.3 that Qji is uniformly 
manageable for its envelope Gji and that PGj^ = 0{R), for < i? < Mq. Thus, Lemma 4.1 
in [12] states that for each (5 > 0, there exist random variables Sn = Op{l) such that 

(6.7) |Fn5(-, n-^l^x) - Pg{-,n-^'^x)\ < 6n-^/^x^ + n'^/^Sl, 

for n^^^^x < Mq. Choose 6 = Ag(xo)/8 in the above inequality. It will result that 



n2/3(F„ - P)g{., n-V^^) > -9A'o(xo)x^ - 



'1/3 



Furthermore, by (5.14), (5.15) and (5.27) 

^2 / 



n^/^Pg{-,n-'^''^x) 



1/3, 



(6.8) 



2$(/3o,xo) 



Ao(xo + en)$(/3o,Xo+0n) 



+ {Xoixo + On) - Xo{xo)} ^'(/3o, XQ + 9„ 



for < n ^/^x < Mo, where $'(/3o,x) = d^{f3Q, x) /dx. From the choice of Mq and 
since Aq(xo) > 0, we can find K > such that for any x > K, 

-r?/^Pg{-,n-^''^x) - Mix > ^Xq{xo)x^, 

for n sufficiently large. We conclude that 
,1 



{ty„(xo + n-i/3x)- W^„(xo)} 
= -n2/3F„5(., - Mix + i?„(t) 

= -r?/^{¥n - P)g{-,n-^/^x) - 7?''^Pg{-,n-^/^x) - Mix + Rn{x) 

> h'oixo)x''-Sl + Rn{x), 

where Rn{x) = Op{n^'^^^x) + o{x) + 0{'nr'^^^x) and the Op, O and o terms do not depend on 
x. It follows that for x > M2 > K, 



n 



I^Mi 



{t?„(xo + n-^l\) - Wn{xo)] > lXo{xo)x^ -Sl + op(l), 



where the op term does not depend on x. Then, M2 can be chosen such that 

F(sl>lx'o{xo)Mi + op{l)) <e, 



for n sufficiently large. We find that 



^nix) - ^ 1 Wn{xo + n-^/^x) - Wn{xo) \ < 0, for some M2 < x < u^/^Mq 

^\PQ,Xq) L J J 

< F Qao(xo)x^ -Sl + op(l) < 0, for some M2 < x < v}'^M^ 

< F (^Sl > ^Xq{xo)x'^ + op(l), for some M2 < x < n^/^Mo^ < e, 
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for n sufficiently large. 

For n~^/^x > Mq, we first show that 



(6.f0) 



Wn{xo + n-^/^x)-Wn(xo)} 



{Wn{xo+Mo/2)-Wn{xo)}, 



with large probability, for n sufficiently large. Then, 



{Wn{xo + Mo/2) - Wn{xo)^ < 



can be bounded with the argument above. Lemma 4.3 and (4.16) yield that Vn{xo + Mq/2) = 
Vn{xo+Mo/2)+o{l), with probability one and by definition Vn{xQ+n~^/^x) > Vn{xQ+n~^^^x), 
for all xq + n^^^^x € [T(^i), ^(n)]- This implies that 



(6.11) 



Vnixo + n-^/^x) - Vnixo + Mo/2) 

> Vnixo + n-^/^x) - Vnixo + Mo/2) + o(l), 

> A„(xo + Mo/2) (Wnixo + n-^'^x) - Wnixo + Mo/2)) + o(l). 



using the convexity of Vn- To show (6.10), note that by definition (5.9), 



<{x) 



n 



-/3Mi 



%lin^l'-Mo/2) 



^nixo^n-^l^x)-Wnihn.Xo)\ 



n 



$(^o,2;o) 



{vF„(xo + Mo/2)-t?„(xo)} 



n 



2/3 



$(/3o,a;o) 



Vnixo + n-^l^x) - Vnixo + Mo/2) 



> 



n 



2/3 



^(/3o,2;o) 



2/3 



Ao(xo) + n-^I^M^ ^nixo + n-^l^x) - Wnixo + Mo/2)} 



\nixo + Mo/2) - Ao(xo) - n-^l'^M^ 



\Wnixo 



+ n-^l^x) - Wnixo + Mo/2)} + o(l) 



M ,Ao(xo + Mo/2)-Ao(xo)-n-i/3^i+o(l)| 
3'(Po,xo) y 1^ J 

X \Woixo + n-i/3^) - VFo(xo + Mo/2) + o(l)} + o(l) J > 0, 



for n sufficiently large, using (4.20) and the fact that Ao and Wo are strictly increasing and 
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n 



'^/^x > Mn. It follows that 



n 



^>(/3o,a;o) 
< F ( Z^(ni/3Mo/2 



^(/3o,2;o) 



^n(a;o)| < 0, for some x > n^^'^Mo 

Wn{xo + Mo/2)-Wn{xo)] <0] < E. 



This completes the proof of (6.4). The other part of (6.1) for a nondecreasing Aq is proven 
similarly. 



For (6.2), in case of a nondecreasing Aq, by the same reasoning that leads to (6.5) we first 
have 

F (n^/^ ^U^{Xo{xo) + n-^/^Mi) - xq} > Ms) < F (^^(x) - Mix < 0, for some x > M2) . 



Moreover, by (5.22), 



where the Op terms do not depend on x. Similar to (6.9), one obtains 



- Mix > ^A'o(xo)x2 -Sl + Op{l), 
for M2 < X < n^/^Mo, where the Op-term does not depend on x, which yields 
F (^Z^(x) - Mix < 0, for some M2 < x < n^/^Mo) < e. 



In the case x > n^/^Mo, similar to (6.11), Theorem 4.1 and (4.25) yield 
(6.12) 

This leads to 



A„(xo + n-^/^x) - A„(xo + Mo/2) > A„(xo + n-^^^x) - A„(xo + Mo/2) + o(l) 

> An(xo + Mo/2)(n-i/=^x - Mo/2) + o(l). 



Z^(x) - Mix > Z^(ni/=^Mo/2) - 
from which we conclude 

F (^Z^(x) - Mix < 0, for some x > n^/^Mo) < e. 



This completes one part of the proof of (6.2) for a nondecreasing Aq- The other part is shown 
similarly. 



For (6.3), using that ui is nonincreasing, similar to (6.5), we first have 



F (n^/^ ^Ulifoixo) + n-'^/^Mi) - xo} > M2) < F (z^(x) - Mix > 0, for some x > M2) , 



Next, according to (5.39), (5.22) and (5.36), we obtain 



Z^(x) - Mix = -(1 - Fo(xo))n2/3(p„ - p)g{., n-^/'^x) 

- (1 - Fo(xo))n2/3pg(.,n-i/32.) - 1(1 - Fo(xo))Ao(xo) V - Mix 
+ Op(n-i/3) + Op(n-V2a.i/2) + ^^(3.) + ^^(x^). 
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where the Op and Op terms do not depend on x and where Pg{-,n ^^^x) is given in (6.8). 



Now, choose 6 = — f (xq) / (8(1 — Fq(xo))) > in (6.7), so that according to Lemma 4.1 in [12] , 
-(1 - Fo(xo))n2/3(P„ - P)g{.,n~y^x) < -^/^(xo)^^ + Sl 



for n ^/^x < Mq and = Op{l). Furthermore, from (6.8) together with (5.40), it follows 
that we can find a K > such that for any x > K, 

-(1 - Fo(xo))n2/3p5(.,n-i/32.) _ ^(i _ Fo(xo))Ao(xo) V - M^x < ^/^(xo)a;2. 



for n sufficiently large. Similar to (6.9) we have for x > M2 > K 



Ziix) - Mix < Q/^(xo) + Op{l)^ x^ + Sl + Op{l), 

where the Op terms do not depend on x, which leads to 

F (^^i{x) - Mix > 0, for some M2 < x < n^/^Mo) < s, 

for n sufficiently large. In the case x > u^^^Mq, first, similar to (4.25), we can obtain that for 
any < M < th, 



sup 

xe[o,Af] 



Fn{x)-Fo{x) < sup \Fn{x) - Ao{x)\ , 

xelo,M] 



which then similar to (6.12) together with Corollary 4.1 yields 

Fnixo + n-^/^x) - Fnixo + Mo/2) < Fn{xo + n-^/^x) - F„(xo + Mo/2) + o(l) 



(6.13) 
This leads to 



< fn{xo + Mo/2){n-^/''x - Mo/2) + o(l). 



ZJ^(x) - Mix < 'Zl{n^/^Mo/2) - Min^l^Mo/2, 
from which we conclude 



P {^^{x) - Mix > 0, for some x > n^/^Mo) < e. 



This completes one part of the proof of (6.3). The other part is shown similarly. 



Finally, the proof of (6.1) and (6.2) in the case of a nonincreasing Aq is along the lines 



of the proof of (6.3), combined with arguments used for the proof of (6.1) and (6.2) in the 
nondecreasing case. □ 

Hereafter, the continuous mapping theorem from [12] will be applied to the inverse pro- 



cesses in (5.1), (5.3) and (5.4), in order to derive the limiting distribution of the considered 



estimators. Let CmaxC^) denote the subset of Bioc(IR) consisting of continuous functions / 
for which /(t) — )• —00, when |t| — )• 00 and / has an unique maximum. 
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Proof of Theorem \3.^ The aim is to apply Theorem 2.7 in [12] and Theorem 6.1 in [lOj . 
Since Theorem 2.7 from |12j appHes to the argmax of processes on the whole real line, we 
extend the process 



Z^(a,x) = Z^(x) 



1/3, 



n ' a 



Wn{xo + n-^/^x)-Wnixo) 



from (5.8) for x € [n^/^{T(^i) — xo),n^^^{T(^n) ~ xq)], to the whole real line. Define Z^{a,x) = 
Z^{a,n^/^{T^i) - xq)), for x < nV3(T(i) - xq) and Z^{a,x) = Z^{a,n'^/^{T(^^^ - xq)) + 1, for 
X > n^/'^(T(„) — Xq). Then, Z^{a,x) € B;oc(lR) and according to (5.8), 

n^/^ < f/^(Ao(xo) + n~^^^a) — xo> = argmin I Z^{a, x)> = argmax I —Z^{a, x) > . 



By Lemma 5.3, for any a fixed, the process — Z„(a, x) converges weakly to the process 



-Z{x) + ax e C 

max{^) with probability one, where Z has been defined in (|5.23[). Lemma 6.1 



ensures the boundedness in probability of n^/^{L^^(Ao(a;o) + n ^^^a) — xq}. Consequently, by 
Theorem 2.7 in [121 it follows that 



n^/''[u^{Xo{xo) + n-^/^a)-xo] 



argmax + ax} = argmin — ax} . 



The same argument applies to the process Z^{x) — ax from (5.6), for x € [— n^/^xo, n^^^{T(^n) ~ 
Xq)], which we extend to the whole real line in a similar fashion. Furthermore, if we fix 
a, 6 G R, it will follow that 



Z^{x)-bx) A (Z (x) — ax, Z(x) — bx), 



by Lemma 5.4 and Lemma 5.3 Hence, the first condition of Theorem 6.1 in |10] is verified. 
The second condition is provided by Lemma 6.1 whereas the third condition is given by (5.6 ) 
and (5.8). Therefore, by Theorem 6.1 in |10| . 



n 



{u^iXoixo) + n-'/\) - xo} , {c^„"(Ao(xo) + n-'/%) - xq}) 4 {u\a), U\b)) , 
where 

U^{a) = sup It : W ( . t] + jAo(xo)t^ - at is minimal I . 

I \'^{Po,xo) / 2 J 

Additional computations show that U^{a) = [/''^(O) + a/Ag(xo) and therefore, by the definition 
of the inverse processes in (5.1) and (5.3), 

F(ni/^{A„(xo)-Ao(xo)} > a, n^s |a„(xo) - Ao(xo)} > &) 

^ F(C/^(a) < 0, U^{b) < 0) = F(-A'o(xo)?7^(0) > a, -X'o{xo)U^{0) > b), 

as n — )■ c«. This implies that 

|a„(xo) - Ao(xo)} |a„(xo) - Ao(xo)}) A (-A'o(xo)i7^(0), -A'o(xo)t/^(0)) , 
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which proves (3.3). To estabhsh the Umiting distribution, define 

A{x) 



V4Ao(x)A'o(x) 



and note that 



n'/^A{xo){Xn{xo) - Ao(xo)} 4 A{xo)X'o{xo)U\0) = argmin {W(t) + t^} 



by Brownian scaUng and the fact that the distribution of C/'^(0) is symmetric around zero. □ 



Proof of Theorem 3.3. The proof of Theorem 3.3 is completely analogous to that of Theo- 
rem 



3.2 The inverse processes to be considered in this case are 

aWn\ 



Un{a) = argmax \ Yn{x) - aW„(/3„, x) \ 
Un{o) = argmax {A„(x) — ax} , 

a:e[0,T(„)] 



for a > 0, where Yn and A„ have been defined in (2.4), (2.11) and (2.8) and fin is 
the maximum partial likelihood estimator. By the same arguments as used in the proof of 
Theorem 3.2 the limiting distribution is expressed in terms of 

^2-1 d 



argmax {W(t) - r} = argmax {-W(t) - t } = argmin{W(t) + t }, 

teR teR teR 



by properties of Brownian motion. 



Proof of Theorem 3.4 ■ Completely similar to the reasoning in the proof of Theorem 3.2 
obtain 

where 



□ 



we 



(a) = sup <^ t : W 



/o(xo)(l-Fo(xo)) 



t ) + -fo{xo)t'^ — at is maximal 



^(/3o,2;o) 

As before, by Brownian scaling, [a) = U-^ (0) + a//o(xo) and together with (5.5) we obtain 



F n 



i'/' {fnixo) - foixo)} < a) ^ F (-/^(xo)t/^(0) < 



Similar to the proof of Theorem 3.2, with 



A{x) 



Hf3o,x) 



1/3 



4/o(x)/^(x)(l-Fo(x)) 
we conclude that n^^^ A{xo){fn{xQ) — fo{xo)} converges in distribution to 

A{xo)fo{xo)Uf (0) = argmax{W(t) - t^} = argmin{W(t) + t^}, 

teR teR 

using Brownian scaling and the fact that the distribution of (0) is symmetric around 
zero. □ 
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